Pivan: A Web-platform for Document Annotation

نویسندگان

چکیده

The Pivan web platform is an open-source tool for managing different stages of automatic document processing, such as layout analysis, transcription, and named entity recognition. It allows the visualization segmentation, transcription at line or paragraph level, annotation entities. Pivan's web-based nature makes it perfectly suited collaborative offers a smooth experience, even small machines connections. based on up-to-date technologies, includes comprehensive API, can be easily deployed via Docker.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Marky: A Lightweight Web Tracking Tool for Document Annotation

Document annotation is an elementary task in the development of Text Mining applications, notably in defining the entities and relationships that are relevant to a given domain. Many annotation software tools have been implemented. Some are particular to a Text Mining framework while others are typical stand-alone tools. Regardless, most development efforts were driven to basic functionality, i...

متن کامل

An Ensemble Click Model for Web Document Ranking

Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...

متن کامل

Orchestration of Semantic Web Services for Large-Scale Document Annotation

Armadillo is a tool that provides automatic annotation for the Semantic Web using unannotated resources like the existing Web for information harvesting, that is: combining a crawling mechanism with an extensible architecture for ontology population. The latter is achieved via largely unsupervised machine learning, boot-strapped from oracles, such as web-site wrappers. It is backed up by ‘evide...

متن کامل

Extensible Framework of Authoring Tools for Web Document Annotation

Web metadata is crucial for providing machine-understandable descriptions of Web resources, and has a number of applications such as discovery, qualification, and adaptation of Web documents. While metadata is often embedded into a target document, metadata can also be associated externally by means of an addressing scheme such as the XPath language. However, creation and modification of extern...

متن کامل

Web Based Training Systems and Document Annotation – Implementations for Hyperwave

Web Based Training Systems are the result of a number of developments in different disciplines of computer science in recent years. These developments include Computer Aided Instruction, Hypertext and Hypermedia, and the World Wide Web. This thesis describes these developments and how they are brought together in Web Based Training Systems. It gives an overview over existing Web Based Training ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Archiving

سال: 2023

ISSN: ['2161-8798', '2168-3204']

DOI: https://doi.org/10.2352/issn.2168-3204.2023.20.1.10